ProbCons: Probabilistic consistency-based multiple sequence alignment.

نویسندگان

  • Chuong B Do
  • Mahathi S P Mahabhashyam
  • Michael Brudno
  • Serafim Batzoglou
چکیده

To study gene evolution across a wide range of organisms, biologists need accurate tools for multiple sequence alignment of protein families. Obtaining accurate alignments, however, is a difficult computational problem because of not only the high computational cost but also the lack of proper objective functions for measuring alignment quality. In this paper, we introduce probabilistic consistency, a novel scoring function for multiple sequence comparisons. We present ProbCons, a practical tool for progressive protein multiple sequence alignment based on probabilistic consistency, and evaluate its performance on several standard alignment benchmark data sets. On the BAliBASE, SABmark, and PREFAB benchmark alignment databases, ProbCons achieves statistically significant improvement over other leading methods while maintaining practical speed. ProbCons is publicly available as a Web resource.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

PROBCONS: Probabilistic Consistency-Based Multiple Alignment of Amino Acid Sequences

Obtaining an accurate multiple alignment of protein sequences is a difficult computational problem for which many heuristic techniques sacrifice optimality to achieve reasonable running times. The most commonly used heuristic is progressive alignment, which merges sequences into a multiple alignment by pairwise comparisons along the nodes of a guide tree. To improve accuracy, consistency-based ...

متن کامل

MSAProbs: multiple sequence alignment based on pair hidden Markov models and partition function posterior probabilities

MOTIVATION Multiple sequence alignment is of central importance to bioinformatics and computational biology. Although a large number of algorithms for computing a multiple sequence alignment have been designed, the efficient computation of highly accurate multiple alignments is still a challenge. RESULTS We present MSAProbs, a new and practical multiple alignment algorithm for protein sequenc...

متن کامل

SPEM: improving multiple sequence alignment with sequence profiles and predicted secondary structures

MOTIVATION Multiple sequence alignment is an essential part of bioinformatics tools for a genome-scale study of genes and their evolution relations. However, making an accurate alignment between remote homologs is challenging. Here, we develop a method, called SPEM, that aligns multiple sequences using pre-processed sequence profiles and predicted secondary structures for pairwise alignment, co...

متن کامل

Multiple Sequence Alignment Tools: Assessing Performance of the Underlying Algorithms

Multiple sequence alignments have primary role in several domains of modern molecular biology such as protein 3D structure/function prediction, phylogeny inference, molecular function, intermolecular interactions and many other common tasks in sequence analysis. Presently, many tools to construct multiple sequence alignments are available but none of them is accurate for all types of data sets....

متن کامل

PROMALS: towards accurate multiple sequence alignments of distantly related proteins

MOTIVATION Accurate multiple sequence alignments are essential in protein structure modeling, functional prediction and efficient planning of experiments. Although the alignment problem has attracted considerable attention, preparation of high-quality alignments for distantly related sequences remains a difficult task. RESULTS We developed PROMALS, a multiple alignment method that shows promi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Genome research

دوره 15 2  شماره 

صفحات  -

تاریخ انتشار 2005